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Abstract. The rank of the matrix multiplication operator for nxn matrices is one of the most 
studied quantities in algebraic complexity theory. We prove new lower bounds that imply the 
rank is at least 3n 2 - 4n2 + n. When n is an even square, the bound is 3n 2 - 4n5 + 3n. The 
previous lower bound, due to Blaser [1], was |n 2 -3n. The new bounds improve Blaser's bound 
for all n > 37. We also prove lower bounds for rectangular matrices and include remarks about 
the geometry of the matrix multiplication operator that may be of interest in their own right. 



1. Introduction 



Let X = (x l j), Y = (y*) be n x n-matrices with indeterminant entries. The rank of matrix 
multiplication, denoted R(M/ nnn \), is the smallest number r of products p p - u p (X)v p (Y) 
■ where u p ,v p are linear forms, such that the entries of the matrix product XY are contained in 

the linear span of the p p . This quantity is also called the bilinear complexity of n x n matrix 
multiplication. More generally, one may define the rank R(i?) of any bilinear map B, see §2. 

From the point of view of geometry, rank is badly behaved as it is not semi-continuous. 
Geometers usually prefer to work with the border rank of matrix multiplication, which fixes the 
£C) ■ semi-continuity problem by fiat: the border rank of a bilinear map B, denoted R(-B), is the 

I/"") , smallest r such that B can be approximated to arbitrary precision by bilinear maps of rank r. 

By definition, one has R(-B) > H(B). 

In [5] G. Ottaviani and I gave new lower bounds for the border rank of matrix multiplication, 
namely R(M^ n n n )) > 2n 2 - n. Those results are used here to prove: 
Theorem 1.1. Let p < n be a natural number. Then 

R(M<n,n,n>) > (3 " "^V - (4p + l)n. 

This bound is maximized when p = - lj or p - \^ - 1], so if ^ e Z, 



> 

o 



R( M <n,n,n>) ^ 3n 2 - 4n§ + 3n. 

These bounds extend to partially rectangular matrix multiplication. Let M^ n in ^ denote the 
multiplication of an n x m matrix by a m x 1 matrix. 

Theorem 1.2. Let p < n be a natural number. Then 

R(M <n>n . m> ) > (1 + -^-)nm + n 2 - (4p + l)n. 

\ / p + 1 

For example, if e Z, taking p = ^ - 1 gives 

R(M( n n n ) ) > 2nm + n 2 - 2n^ (m + n) + 3n. 
The previous bound, due to Blaser [2], was R(M^ n n m )) > 2nm - m + 2n - 2. 
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Remark 1.3. There is no known nontrivial lower bound on rank in terms of border rank for 
matrix multiplication or tensors in general. Regarding upper bounds, if T is a tensor of border 
rank r, where the approximating curve of rank r tensors limits in such a way that q derivatives 
of the curve are used, then the rank of T is at most (2g - l)r, see [3, Prop. 15.26]. 

The language of tensors will be used throughout. In §2 the rank question and matrix mul- 
tiplication are rephrased in the language of tensors. In §3 the equations used to obtain lower 
bounds for border rank in [5] are expressed in a manner suitable for the proof of Theorem 1.2, 
which is proved in §4. I work over the complex numbers throughout. 

2. Matrix multiplication and its rank 

Let A,B,C be vector spaces, of dimensions a, b,c and with dual spaces A*, B* , C* . That is, 
A* is the space of linear maps A -»• C. Write A*®B for the space of linear maps A -*■ B and 
A*®B*®C for the space of bilinear maps A x B -> C . To avoid extra *-s, I work with bilinear 
maps A* x B* -> C, i.e., elements of A®B®C. Let T : A* x B* -* C be a bilinear map. One 
may also consider T as a linear map T : A* ->• B®C (and similarly with the roles of A,B,C 
exchanged), or as a trilinear map i*xB* xC* ->C. 

The rank of a bilinear map T : A* x 5* -> C , denoted R(T), is the smallest r such that there 
exist a±, a r e A, 6i, 6 r e B, ci, c r e C such that T(a, /3) = ££ =1 cn(,<^)bi(P)ci for all a e A* 
and /3 e 5*. Equivalently (see e.g., [4, §3.1]), considering the map T : A* -> B®C, R(T) is 
the smallest number of rank one elements of B®C needed to span a linear space containing the 
linear space T(^4*). 

Let M^ m n i) : Mat mxn x Mat nx \ -> Mai mx i denote the matrix multiplication operator. Write 
M = C m , N = C n and L = C 1 . Then 

M (m,n,l) : (N®L*) x (L®M*) -+ N®M* 

has the interpretation as M^ m n l ) = IdN®Id,M®IdL e (A^ig^^L"®]^^.^®./^), where idiv e 
N*®N is the identity map. If one thinks of Af/ mnj i^ as a trilinear map (N®L*) x (L®M*) x 
(N®M*) -> C, in bases it is (X,Y,Z) >-> tia,ce(XY Z) . If one thinks of M/ m n n as a linear map 
N®L* -> (L*®M)®(N®M*) it is just the identity map tensored with /djvf- In particular, if 
a e N®L* is of rank g, its image, considered as a linear map L®M* -* N®M* , is of rank qm. 

Returning to general tensors T e j4®i?®C7, from now on assume b = c. When T = M/ mjI1 n, 
one has ^4 = N*®L, B = L*®M, C = N®M*, so b = c is equivalent to 1 = n. For any tensor 
T 6 A®B®C, if there exists a € A* such that T(a) is of maximal rank b, then one may use the 
linear map T(a) : B* -*■ C to identify C - £?*, and consider T^*) c B®B* as a subspace of the 
space of linear maps B -*■ B. 

Applying this to matrix multiplication with 1 = n, M/ njn m \ e (A r *®L)(gi(L*®M)<8i(A r (8)M*), a 
choice of a also allows one to identify L ^ N, which, letting qI(N) denote the algebra of linear 
maps N -> iV, induces a map 

(1) M A :fll(^)-flt(5). 
It follows immediately that: 

Key Observation 2.1. The map (1) is an inclusion of Lie algebras. 

That the rank of a tensor can be studied via a subspace of endomorphisms was the idea 
behind Strassen's equations for border rank [7]. He observed that if a = 3, and one takes a basis 
a , a , c? of A* such that T(aP) has maximal rank b (this implies R(T) > b), then R(T) = b if 
and only if T(a 1 ),T(a 2 ), considered as endomorphisms, commute. More generally, he showed 
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that the border rank is bounded below by b plus half the rank of their commutator considered 
as a linear map, [T(a 1 ),T(a 2 )] : B -*■ B. To see the first assertion, note that if T(a 1 ),T(a 2 ) 
commute and are diagonalizable, then they are simultaneously diagonalizable, and each is a 
linear combination of the b rank one elements on the diagonal. 

It follows from these remarks that Strassen's commutator [M(a 1 ), M(a 2 )] of b x b matrices 
(where b = nm), has rank equal to m times the rank of the commutator of n x n matrices 
[a 1 , a 2 ]. In particular, for generic a 1 , a 2 , it will be of maximal rank. In general, the Key 
Observation allows one to transport questions about expressions in commutators in Ql(B) = g[ n 2 
to expressions in commutators in gl(N) - g[ n . 



3. The equations of [5] in coordinates 

Strassen's equations were rephrased in [6] as follows: given T e A®B®C ', consider T®Id,A e 
A®B®C®A®A* = A* ®B®A®A®C ', and its skew-symmetrization in the A® A factor, T^ 1 6 
A* ®B®K 2 A®C , which we think of as a linear map A®B* -*■ A 2 A®C. If a = 3 and b = c, 
the corresponding map expressed in bases is a 3b x 3b matrix. If 00,01,02 is a basis of A 
and one chooses bases of B,C, then elements of B®C may be written as matrices, and T = 
clq®Xq + a\®X\ + a2®X2, where the Xj are size b square matrices. With appropriate ordering, 
the corresponding matrix for T^ 1 is 

Mcrf(Tf ) 

Now assume Xq is invertible and change bases such that it is the identity matrix. Recall the 
formula for block matrices 




(2) 



det (z w) = det ( w ) det ( x - zw ~ lY )> 



assuming W is invertible. Then, using the (b, 2b) blocking (so X = in (2)) 
detMat(T^ 1 ) = det(XiX 2 - X 2 Xi) =det([X lt X 2 ]). 

showing the equivalence of the two perspectives for the highest order Strassen equation, which 
is the only relevant one for this paper. 

Strassen's equations were generalized in [5] and the generalization was used to prove new 
lower bounds for the border rank of matrix multiplication. Let a = 2p + 1. Consider T®Id^vA e 
A®B®C®K P A®K P A* and its skew-symmetrization to 

(3) : K p A®B* - X P+1 A®C. 

By the choices of dimensions, this is a linear map between vector spaces of the same dimension 
( 2p+1 )b. The result is that if the border rank of T is at most r, then the rank of the linear map 

is at most ?"( 2p )- 

Writing T = ao®Xo + ■■■a2p®X2 P , the expression of (3) in appropriate bases is of the form 



(4) 



(0 Q\ 
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where this matrix is blocked ((p+Jb, ( p p )b) x (( p ^ 1 )b, ( p p )b), 

/ x \ 



\X 



I 



Q has entries in blocks consisting of Xi,---,X 2p and zero, and Q is the block transpose of Q 
except that if an index is even, the block is multiplied by -1 (blocks with odd indices stay the 
same). 

For example, when p = 2, 



Q = 



Note that if Xq is the identity matrix, the determinant of (4) when p = 2 equals the determi- 
nant of 



/ 








*1 


-x 2 







Xi 


-x 2 








^3 


X x 





-x 3 





x 4 





\x 2 


-x 3 





x 4 





o ) 



(5) 



/ 

[x 2 ,x 1 ] 



[Xi,X 2 ] 


[X 3 ,x 2 ] 

[X 4 ,X 2 ] 



[Xi,X 3 ] 

[x 2 ,x 3 ] 



[x 4 ,x 3 ] 



[X 2 ,X 4 ] 

[x 3 ,x A ] 
/ 



In general, when Xq is the identity matrix, the determinant of (4) equals the determinant of 
the block 2ph x 2ph matrix whose (i,j)-th block entry is 

Remark 3.1. It would be interesting to have direct geometric interpretations of the hypersurface 
in the Grassmannian G(2p,sl(B)) of 2p-dimensional subspaces of the traceless endomorphisms 
sl(-B) that satisfy the equation that, if one takes a basis Xi,---,X 2p of the subspace, that the 
determinant of the matrix whose block entries are is zero. The derivation shows that 

the vanishing of the determinant is independent of our choice of basis. 



4. Proof of Theorem 1.2 

Recall the following standard lemma (see, e.g., [4, §11.5]): 

Lemma 4.1. Let U be a vector space, let P be a polynomial on U of degree d. Let ui,---,u u 
be a basis of U. Then there exists a subset u^, ■■•,Ui 3 of cardinality s < d such that P ... jWjs ) 
is not identically zero. 

Lemma 4.1 says that a quadric surface in P 3 cannot contain three lines whose pairwise inter- 
sections span P 3 , and more generally, a hypersurface of degree d in FU cannot contain the (") 
P d_1 's of intersection of a collection of hyperplanes whose dual points span WU* . 

Let G(k,A) denote the Grassmannian of fc-planes through the origin in the vector space A. 
Given F e G(£,A), define the Schubert variety S| := {E e G(k,A*) | E c F 1 }. 

Lemma 4.2. Let b = (ai, a n 2 ) be a basis of A - N*®L. For each I c [n 2 ] of cardinality Ap+ 1, 
let Fj = (a h , -, a iAp+1 ) e G(4p + 1,A). Let S b = u 7 s| +1 c G{2p + I, A*). 

Let M ■■- IdL®IdN e A®N®L* . Consider the polynomial Q on G{2p + I, A*) given by, for 
A' e G(2p + 1,A*), the determinant of the linear map 

(MIa'vlvn*)^ ■ A p A'®L - Xf +1 A'®N. 
Then for all bases b, Q\s b is not identically zero. 
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As described above, a choice of a general a e A* enables one to translated the vanishing of 
Q to the vanishing of a polynomial Qq on G(2p,s[(L)). 

To make the connection with Blaser's work more transparent, Lemma 4.2 may be rephrased 
as follows: 

Lemma 4.3. Let A = N*®L, where 1 = n. Given any basis of A, there exists a subset of at least 
n 2 - (4p + l)n basis vectors, and elements oP , a 1 , a 2p of A* , such that 

(1) oP is of maximal rank, and thus may be used to identify L 2 N and A as a space of 
endomorphisms. (I.e., in bases oP is the identity matrix.) 

(2) Choosing a basis of L, so the o? become n x n matrices, the size 2pn block matrix whose 
(i,j)-th block is [a 1 , a 1 ] has nonzero determinant, and 

(3) The subset of n 2 - (4p + l)n basis vectors annihilate oP , a , a 2p . 

Proof. Assume a basis b has been given. By Lemma 4.1 with P = det n , we may find a subset 
S\ of at most n elements of our basis of A with some oP e Span(Si) with det n (a°) t 0. Use 
a ■ L -> N to identify N ^ L which enables us to now consider A = gl(L) as an algebra with a 
playing the role of the identity element. 

Let v 1,0, i>2p,o e A be linearly independent and not equal to any of the given basis vectors. 
Let Eq = (t>i,o> '"j V2p,o)- Work locally on an affine open subset A c G(2p,sl(L)) = G(2p, A* /(a )) 
about £{)• Extend ui,or"j u 2p,o to a basis vip, V2 P fl, w\, u> n 2_ 2p -i of st(L). Take local 
coordinates (/!*), 1 < s < 2p, 1 < /i * n 2 - 2p - 1, on A c G(2p,sl(L)) by writing v s = v St o + fgW^. 
That is, E(f) € A is E(fg) = (ni )0 + /f w^, -, v 2p ,o + fZpWp}- 

By [5, §5], Qo is n °t identically zero on G(2p,st(L)) and therefore it is not identically zero 
on A. By the discussion in §3, up to scale, the polynomial Qq is det([v s ,vt]). The entries of v s 
are linear in the ft?, so the entries of the matrix ([v S )^t]) are quadratic in the fg . Thus Qq is a 
polynomial of degree 4pn in the ft*. 

Applying Lemma 4.3 with P = Qq, and recalling that we have already used up to n basis 
elements to obtain oP, gives (4p + l)n basis elements such that Qq restricted to their span is not 
identically zero. □ 

Remark AA. The only property of Qq that was used in the proof was its degree. It may be 
possible to improve the lower order terms in Theorem 1.1 by taking properties of Qq into 
account. For example, Blaser's theorem is the case p-1. There the polynomial 

[a\a 2 ]\ 
-[a 1 , a 2 ] ) 

is a square, and one may work instead with det([a 1 , a 2 ]), which explains the improvement in 
the lower order terms in the case p-1. 

Proof of Theorem 1.2. Let (j) be an expression of M^ n n m ^ as a sum of r rank one tensors, 
and assume r to be minimal. Since Lker(M^ nn)m ^) = (i.e., Va e A*\0, 3/3 e B* such that 
M(a,/3) 4- 0) we may write (j) - ^i + ^2 with R(^i) = n 2 , H(tp2) - r - n 2 and Lker('^i) = 0. 
Now consider the n 2 elements of A* appearing in Since they span A*, by Lemma 4.3 we 
may choose a subset of n 2 - (Ap + l)n of them that annihilate a maximal rank element a and 
some a a p such that, choosing bases, the determinant of the matrix ([a l ,a J ]) is nonzero. 
Let denote the sum of all monomials in ipi whose A* terms annihilate oP , a , a 2p , so 
R(0i) > n 2 - (4p + l)n. Let (fr 2 = tpi - 4>i + tp2- 

Now R(<fe) ^ nm ^TT because the determinant of the linear map M^ n ii m ^\Ai^B*®C* '■ A p A'®i?* -*■ 
K p+l A'®C is the determinant of the linear map M\a'®l®n* raised to the m-th power, and the 
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determinant of the second map is nonzero by the argument in [5, §5]. Finally R(-M/ n ,n,m)) = 
R(0i) + R(<fc). ' ' □ 

References 

1. Markus Blaser, A |n 2 -lower bound for the rank of nxn-matrix multiplication over arbitrary fields, 40th Annual 
Symposium on Foundations of Computer Science (New York, 1999), IEEE Computer Soc, Los Alamitos, CA, 
1999, pp. 45-50. MR MR1916183 

2. , On the complexity of the multiplication of matrices of small formats, J. Complexity 19 (2003), no. 1, 

43-60. MR MR1951322 (2003k:68040) 

3. Peter Biirgisser, Michael Clausen, and M. Amin Shokrollahi, Algebraic complexity theory, Grundlehren der 
Mathematischen Wissenschaften [Fundamental Principles of Mathematical Sciences], vol. 315, Springer- Verlag, 
Berlin, 1997, With the collaboration of Thomas Lickteig. MR 99c:68002 

4. J. M. Landsberg, Tensors: geometry and applications, Graduate Studies in Mathematics, vol. 128, American 
Mathematical Society, Providence, RI, 2012. MR 2865915 

5. J.M. Landsberg and Giorgio Ottaviani, New lower bounds for the border rank of matrix multiplication, preprint, 
arXiv:1112.6007. 

6. Giorgio Ottaviani, Symplectic bundles on the plane, secant varieties and Liiroth quartics revisited, Vector 
Bundles and Low Codimensional Subvarieties: State of the Art and Recent Developments (R. Notari G. Casnati, 
F. Catanese, ed.), Quaderni di Matematica, vol. 21, Dip. di Mat., II Univ. Napoli, 2007, pp. 315-352. 

7. V. Strassen, Rank and optimal computation of generic tensors, Linear Algebra Appl. 52/53 (1983), 645-685. 
MR 85b: 15039 

E-mail address: jml@math.tamu.edu 



